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Abstract 

In this paper we further investigate the well-studied problem of finding a perfect matching 
in a regular bipartite graph. The first non-trivial algorithm, with running time 0(mn), dates 
back to Konig's work in 1916 (here m = nd is the number of edges in the graph, In is the 
number of vertices, and d is the degree of each node). The currently most efficient algorithm 
takes time O(m), and is due to Cole, Ost, and Schirra. We improve this running time to 

0(min{m, - — j^}); this minimum can never be larger than 0(n 175 Vhm). Wc obtain this 
improvement by proving a uniform sampling theorem: if we sample each edge in a d-regular 
bipartite graph independently with a probability p = 0( nl J^ n ) then the resulting graph has a 
perfect matching with high probability. The proof involves a decomposition of the graph into 
pieces which are guaranteed to have many perfect matchings but do not have any small cuts. 
We then establish a correspondence between potential witnesses to non-existence of a matching 
(after sampling) in any piece and cuts of comparable size in that same piece. Karger's sampling 
theorem for preserving cuts in a graph can now be adapted to prove our uniform sampling 
theorem for preserving perfect matchings. Using the 0{m^/n) algorithm (due to Hopcroft and 
Karp) for finding maximum matchings in bipartite graphs on the sampled graph then yields the 
stated running time. We also provide an infinite family of instances to show that our uniform 
sampling result is tight up to poly-logarithmic factors (in fact, up to In 2 n). 

1 Introduction 

A bipartite graph G = (U,V,E) with vertex set UL)V and edge set E C U x V is said to be regular 
if every vertex has the same degree d. We use m = nd to denote the number of edges in G and n 
to represent the number of vertices in U (as a consequence of regularity, U and V have the same 
size). Regular bipartite graphs have been the subject of much study. Random regular bipartite 
graphs represent some of the simplest examples of expander graphs [12] ■ These graphs are also 
used to model scheduling, routing in switch fabrics, and task-assignment problems (sometimes via 
edge coloring, as described below) [HE]. 

A regular bipartite graph of degree d can be decomposed into exactly d perfect matchings, a fact 
that is an easy consequence of Hall's theorem Finding a matching in a regular bipartite graph 
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is a well-studied problem, starting with the algorithm of Konig in 1916, which is now known to run 
in time 0(mn) [TT]. The well-known bipartite matching algorithm of Hopcroft and Karp [8] can be 
used to obtain a running time of 0{rri\fn). In graphs where d is a power of 2, the following simple 
idea, due to Gabow and Kariv [7], leads to an algorithm with 0{m) running time. First, compute 
an Euler tour of the graph (in time 0(m)) and then follow this tour in an arbitrary direction. 
Exactly half the edges will go from left to right; these form a regular bipartite graph of degree d/2. 
The total running time T(m) thus follows the recurrence T(m) = 0(m) + T(m/2) which yields 
T(m) = 0(m). Extending this idea to the general case proved quite hard, and after a series of 
improvements (eg. by Cole and Hopcroft [5], and then by Schrijver [13] to 0(md)), Cole, Ost, and 
Schirra [6] gave an 0(m) algorithm for the case of general d. 

The main interest of Cole, Ost, and Schirra was in edge coloring of general bipartite graphs of 
maximum degree d, where finding perfect matchings in regular bipartite graphs is an important 
subroutine. Finding perfect matchings in regular bipartite graphs is also closely related to the 
problem of finding a Birkhoff von Neumann decomposition of a doubly stochastic matrix [3l [16] . 

In this paper we present an algorithm for finding a perfect matching in a regular bipartite graph 
that runs in time 0(min{m, n °^ nn })- It is easy to see that this minimum can never be larger than 
0(n L75 Vlnn). This is a significant improvement over the running time of Cole, Ost, and Schirra 
when the bipartite graph is relatively dense. We first prove (Theorem 12.11 in section [2]) that if we 
sample the edges of a regular bipartite graph independently and uniformly at rate p = 0( n 5 n ), 
then the resulting graph has a perfect matching with high probability. The resulting graph has 
0(mp) edges in expectation, and running the bipartite matching algorithm of Hopcroft and Karp 

2.5 1 

gives an expected running time of 0( n d ). Since we know this running time in advance, we 

2.5 i 

can choose the better of m and - — -^-^ in advance. It is worth noting that uniform sampling can 
easily be implemented in 0(1) time per sampled edge assuming that the data is given in adjacency 
list format, with each list stored in an array, and assuming that logn bit random numbers can be 
generated in one time step0. 

We believe that our sampling result is also independently interesting as a combinatorial fact. 
The proof of our sampling theorem relies on a sequential decomposition procedure that creates 
a vertex-disjoint collection of subgraphs, each subgraph containing many perfect matchings on 
its underlying vertex set. We then show that if we uniformly sample edges in each decomposed 
subgraph at a suitably chosen rate, with high probability at least one perfect matching survives 
in each decomposed subgraph. This is established by using Karger's sampling theorem J9[ [10] in 
each subgraph. An effective use of Karger's sampling theorem requires the min-cuts to be large, 
a property that is not necessarily true in the original graph. For instance, G could be a union of 
two disjoint ci-regular bipartite graphs, in which case the min-cut is 0; non-pathological examples 
are also easy to obtain. However, our serial decomposition procedure ensures that the min-cuts 
are large in each decomposed subgraph. We then establish a 1-1 correspondence between possible 
Hall's theorem counter-examples in each subgraph and cuts of comparable size in that subgraph. 
Since Karger's sampling theorem is based on counting cuts of a certain size, this coupling allows 
us to claim (with high probability) that no possible counter-example to Hall's theorem exists in 
the sampled graph. On a related note, Benczur [2] presented another sampling algorithm which 
generates 0(n Inn) edges that approximate all cuts; however this sampling algorithm, as well 
as recent improvements [15|. [T3] take Cl(m) time to generate the sampled graph. Hence these 

Even if we assume that only one random bit can be generated in one time step, the running time of our algorithm 
remains unaltered since the Hopcroft-Karp algorithm incurs an overhead of y/n per sampled edge anyway. 
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approaches do not directly help in improving upon the already known 0(m) running time for 
finding perfect matchings in (i-regular bipartite graphs. 

The sampling rate we provide may seem counter-intuitive; a superficial analogy with Karger's 
sampling theorem or Benczur's work might suggest that sampling a total of O(nlnn) edges should 
suffice. We show (Theorem 14.11 section H]) that this is not the case. In particular, we present a 
family of graphs where uniform sampling at rate o( js" ) results in a vanishingly low probability 
that the sampled subgraph has a perfect matching. Thus, our sampling rate is tight up to factors 
of 0(ln 2 n). This lower bound suggests two promising directions for further research: designing 
an efficiently implementable non-uniform sampling scheme, and designing an algorithm that runs 
faster than Hopcroft-Karp's algorithm for near-regular bipartite graphs (since the degree of each 
vertex in the sampled subgraph will be concentrated around the expectation). 

2 Uniform Sampling for Perfect Matchings: An Upper Bound 

In this section, we will establish our main sampling theorem stated below. We will then show 
in Section [3] that this theorem immediately yields an 0(n L75 v / kin) time algorithm for finding a 
perfect matching in regular bipartite graphs. 

Theorem 2.1 There exists a constant c such that given a al-regular bipartite graph G(U,V,E), 
a subgraph G' of G generated by sampling the edges in G uniformly at random with probability 
p = cn ^ n contains a perfect matching with high probability. 

Our proof is based on a decomposition procedure that partitions the given graph into a vertex- 
disjoint collection of subgraphs such that (i) the minimum cut in each subgraph is large, and (ii) 
each subgraph contains £l(d) perfect matchings on its vertices. We then show that for a suitable 
choice of sampling rate, w.h.p. at least one perfect matching survives in each subgraph. The union 
of these perfect matchings then gives us a perfect matching in the original graph. We emphasize 
here that the decomposition procedure is merely an artifact for our proof technique. Note that the 
theorem is trivially true when d is O(Vnmn). So in what follows, we assume that d is £l(Vn Inn). 

2.1 Hall's Theorem Witness Sets 

Let G(U, V, E) be a bipartite graph. We shall use the following notation. For a graph G and a set 
of vertices W we denote the number of edges crossing the boundary of W in G by 5g(W). Also, 
we denote the vertex set of G by V{G). 

A pair (A, B) with A C U and B C V is said to be a left relevant pair to Hall's theorem if 
\A\ > \B\. Similarly, a pair (A,B) with A C U and B C V is said to be a right relevant pair to 
Hall's theorem if \A\ < \B\. 

Given a left relevant pair (A, B), we denote by E{A, B) the set of edges in E Pi (A x (V \ B)). 
Similarly, given a right relevant pair (^4, B), we denote by E(A, B) the set of edges in Ef]((U\A) x 
B). We refer to the set E(A,B) as a witness edge set if (A, B) is a left or right relevant pair. By 
Hall's theorem (see, for instance, [1]), to prove Theorem 12.11 it suffices to show that w.h.p. in the 
sampled graph G', at least one edge is chosen from each witness set. We will focus on a sub-class 
of relevant pairs, referred to as minimal relevant pairs. A left relevant pair (A, B) is minimal if 
there does not exist another left relevant pair (A',B') with A' C A and E(A',B') C E(A,B). 
Minimal right relevant pairs are similarly defined. A witness edge set corresponding to a minimal 
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left relevant pair or a minimal right relevant pair is called a minimal left witness set or a minimal 
right witness set, respectively. If a graph G has a perfect matching, every minimal witness set must 
be non-empty. It also follows that any subgraph of G that includes at least one edge from every 
minimal witness set must have a perfect matching. 

A key idea underlying our proof is a mapping from minimal witness sets in G to distinct cuts 
in G. In particular, we will map each minimal left witness set E(A, B) to the cut 5 G (A U B). The 
theorem below shows that this is a one-to-one mapping. The analogous theorem holds for minimal 
right witness sets. 

Theorem 2.2 Let G(U, V, E) be a bipartite graph that has at least one perfect matching. If (A, B) 
and (A',B') are minimal left relevant pairs in G with E(A,B) ^ E(A',B'), then 5 G (A UB) ^ 
Sg(A'L)B'). 

Proof: Assume by way of contradiction that there exist minimal left relevant pairs (A, B) and 
(A', B') in G with E(A, B) / E(A', B') but 5 G (A\JB) = S G (A'UB'). Then the following conditions 
must be satisfied for any edge (u, v) G E : 

Al. If u G (A\ A') U (A' \ A) then v £ (B \ B') U (B' \B). To see this, assume w.l.o.g. that 
u G A\A', and then note that if v G BOB', then (u,v) G 5 G (A'uB') but (u,v) ^5 G (Al)B). 
A contradiction. Similarly, if v G V\(BUB'), then (u,v) G 5 G (AuB) but (u,v) G" S G (A'UB'). 
A contradiction. 

A2. Hue (An A') then v £ (B \ B') U (B' \B). To see this, consider w.l.o.g. that v G (B \ B'). 
Then (u,v) G 5 G (A' U B') but (u,v) G" S G (A U B). A contradiction. 

In what follows, we slightly abuse the notation and given any (not necessarily relevant) pair 
(C, D) with C C U and D C V, we denote by ^(C, D) the set of edges in E n (C x (V \ £))). As an 
immediate corollary of the properties Al and A2, we now obtain the following containment results: 

Bl. E(A \A',B\ B') C E(A, B). This follows directly from property Al above. 
B2. E(A f]A',Bf] B') C E(A, B). This follows directly from property A2 above. 

We now consider three possible cases based on the relationship between A and A' , and establish 
a contradiction for each case. 

Case 1: A n A' = 0. By property Al, if u G A U A' then v € B U B'. In other words, there are 
no edges from A U A' to vertices outside 5 U Since |A U A'\ = \A\ + > \B\ + \B% this 
contradicts our assumption that G has at least one perfect matching. 

Case 2: A = A'. For any edge (u, v) with u £ A, property A2 shows that v £ (B \ B') U (£' \ B). 
Then B) = E(A',B'). A contradiction. 

Case 3: AnA'^V) and A / A'. Assume w.l.o.g. that A \ A' ^ 0. Since |^| > |J3|, it must be 
that either \A\A'\ > \B\B'\ or \AnA'\ > \BnB'\. If \A\A'\ > |J3 \ J3'|, then (A \ A', B \ B') is a 
left relevant pair, and by Bl, it contradicts the fact that (A, B) is a minimal left relevant pair. If 
\A n A'\ > \B n then (^4 ni',Bn S') is a left relevant pair set, and by B2, it contradicts the 
fact that (A, B) is a minimal left relevant pair. 
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2.2 A Decomposition Procedure 

Given a (i-regular bipartite graph on n vertices, we will first show that it can be partitioned into 
k = 0(n/d) vertex disjoint graphs Gi(U\, Vi, E{), G^C^, V2, E2), Vy., E^) such that each 

graph Gi satisfies the following properties: 

PI. the size of a minimum cut in Gi(Ui,Vi, Ei) is strictly greater than a = 

P2. \5g{Ui U Vi)\ < d/2 (hence Gi contains at least d/2 edge-disjoint perfect matchings). 

The decomposition procedure is as follows. Initialize Hi = G, and set i = 1. 

1. Find a smallest subset Xi C V{Hj) such that < 2a. If no such set exists, then 
the decomposition procedure terminates. 

2. Define Gi to be the subgraph of Hi induced by the vertices in Xi. Also, let Mj denote the 
number of edges in the cut ^(Xj). 

3. Define -ffj+i as Hi with vertices from Xi removed. 

4. Increment i, and go to step (1). 

We now prove the following properties of the decomposition procedure. 

Theorem 2.3 The decomposition procedure outlined above satisfies properties PI and P2. 

Proof: We start by proving that property PI is satisfied. Suppose that there exists a cut (C,D) 
in Gi of value less than a, i.e. C U D = Xi and 5d{C) = Sg % (D) < a. We have |<5ff,(C) \ 
(5(2, (C) I + \5Hi(D) \ Sq^D) I < 2a by the choice of Xi in (1). Suppose without loss of generality 
that |<5flv(C) \ 5Gi(C)\ < a. Then <5^(C) < 2a and C C Xi, which contradicts the choice of Xi as 
the smallest cut of value at most 2a in step (1) of the procedure. 

It remains to show that \5c{Ui U Vi)\ < d/2 for all i. In order to establish this property, it 
suffices to show that Ym=i M i ^ d / 2 (recall that Mj = \S Hi ( x i)\)- 

We prove the following statements by induction on k, the number of decomposition steps: 

1. |^ fc UVfc| > 2d; 
2- EtiM t <d/2; 
3. k < n/d. 

Base: k = 1 Since 2a = < d/2, we have M\ < d/2. It remains to show that G\(JJ\, V\, E\) 
has at least 2d vertices. Consider any vertex u £ U\. Let j < d/2 be the number of edges in 
$h 1 (Ui U V\) that are incident on vertex u. Then u must have exactly (d — j) neighbors in 
V\. Since |<5f/i {U\ U Vi)| < d/2, at least one vertex among the neighbors of u in V\ must have 
all its d neighbors inside U\. Thus \U\\ > d. Similarly, we can show that \V\\ > d. 
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Inductive step: k — ► k + 1 Suppose that the £;-th step has been executed and the algorithm 
has not terminated yet. Since k < n/d by the inductive hypothesis, we have X/i=i -^i — 
(n/rf) (2a) = (n/d) (|Q < d/2. Consider the cut (X k ,H k \ X k ) of # fe . It follows from the 

previous estimate that \5jj k (X k )\ > ^(A^)! — d/2. Hence, we conclude as in the base case 
that \X k \ > 2d and \H k \ X k \ > 2d. Since at every decomposition step j < k at least 2d 
vertices were removed from the graph, we have k + 1 < n/d. 



2.3 Proof of Theorem I2TT1 

We now argue that if the graph G is obtained by uniformly sampling the edges of G with probability 
p — q (Is.™) ; then w.h.p. G' contains a perfect matching. 

It suffices to show that in each graph Gi obtained in the decomposition procedure, every minimal 
witness set is hit w.h.p. in the sampled graph (that is, at least one edge in each minimal witness 
set is chosen in the sampled graph). This ensures that at least one perfect matching survives inside 
each Gi. A union of these perfect matchings then gives us a perfect matching of G in the sampled 
graph G' . 

Fix a graph Gi(Ui,Vi, E{). Let (A,B) be a left or a right relevant pair in Gi. Using the fact 
that our starting graph G is (i-regular, we get 

\S G (AUB)\ <2\E(A,B)\-d. 

Let mA,'mB denote the number of edges in G that connect nodes in A, B respectively to nodes 
outside Gi. Then 

\8 G .{AUB)\ < 2\E(A,B)\-d-m A -m B . 

By propertyE2l since \5 G (Ui U V-)\ < d/2, it follows that \E(A,B) n Ei\ > \E(A,B)\ - d/2. Also, 
by definition, \E(A,B) n Ei\ > \E(A,B)\ — uia — itib- Combining, we obtain: 

\5 Gi (AuB)\ < 2\E(A,B)nEi\-d/2. 

Thus the set E(A, B) n Ei contains at least half as many edges as the the cut S Gi (A U B). We will 
now utilize the following sampling result due to Karger |10j : 

Theorem 2.4 |10] Let Gi be an undirected graph on at most n vertices, and let k be the size of a 
minimum cut in Gi. There exists a positive constant c such that for any e £ (0, 1), if we sample 
the edges in Gi uniformly with probability at least p = c (^r) , then every cut in Gi is preserved to 
within (1 ± e) of its expected value with probability at least 1 — l/n^ 1 ). 

Thus the sampling probability needed to ensure that all cuts are preserved close to their expected 
value, is inversely related to the size of a minimum cut in the graph. We now show use the theorem 
above to prove that at least one perfect matching survives in each graph Gi when edges are sampled 
with probability specified in Theorem 12.11 

By Property IPll we know that the size of a minimum cut in Gi is at least a = d 2 /An. Fix an 
e G (0, 1). The theorem above implies that if we sample edges in G with probability p = 9 (^) , 



6 



then for every relevant pair (A,B), w.h.p. the sampled graph contains (lzize)p\SG i (AUB)\ = Q(lnn) 
edges from the set Sq^A U B). 

Note that the set 5^ (AL)B) is not a Hall's theorem witness edge set. However, by Theorem 12, 1| 
we know that for every left (right) minimal witness edge set E(A, B)(~]Ei, we can associate a distinct 
cut, namely 5cj i (AuB), of size at most twice \E(A, B)(~)Ei\. We now show that this correspondence 
can be used to directly adapt Karger's proof of Theorem 12.41 to claim that every witness edge set 
in Gi is preserved to within (1 ± e) of its expected value. We remind the reader that the proof of 
Karger's theorem is based on an application of union bound over all cuts in the graph. In particular, 
it is shown that the number of cuts of size at most (5 times the minimum cut size is bounded by 
n 2 ^. On the other hand, for the sampling rate given in Theorem 12.4^ we can use Chernoff bounds 
to claim that the probability that a cut of size (3 times the minimum cut deviates by (1 ± e) from 
its expected value is at most l/nP^ 3 '. The theorem follows by combining these two facts. 

Within any piece of the decomposition, let q be the number of cuts of size i and let Wi be the 
number of minimal witness sets of size i. We know by the correspondence argument above that 
every Hall's theorem minimal witness set of size i corresponds to a cut of size at most 2i, and at 
most two minimal witness sets (one left and one right) correspond to the same cut. 

Now, given a sampling probability p, the probability that none of the edges in some minimal 
witness set are sampled is at most £V Wi(l — p)\ which is at most £V 2cj(l — p) % l 2 . Therefore the 
probability that there is no matching in this piece can be at most twice the expression used in 
Karger's theorem to bound the probability that there exists a cut from which no edge is sampled 
when the sampling rate is q, where 1 — q = (1 — p) 1 ^ 2 , or p = 2q — q 2 . Hence, it is sufficient to 
use a sampling rate which is twice that required by Karger's sampling theorem to conclude that a 
perfect matching survives with probability at least 1 — 1 /rf 1 ^ . 

Even though we don't use it in this paper, the following remark is interesting and is worth 
making explicitly. The remark follows from the additional observation that Karger's proof [10] of 
theorem 12 .41 uses Chernoff bounds for each cut, and these bounds remain the same if we use minimal 
witness sets which are at least half the size of the corresponding cuts, and then sample with twice 
the probability. 

Remark 2.5 There exists a positive constant d such that for any e G (0, 1), if we sample the edges 
in G uniformly with probability at least p = c' (j^r)> then every minimal witness set in every piece 
Gi is preserved to within (1 ± e) of its expected value with probability at least 1 — l/n™ 1 ' . Here 
a = d 2 /{An), as defined before. 

Putting everything together, the sampled graph G' will have a perfect matching w.h.p. as long as 
we sample the edges with probability p > for a sufficiently large constant c, thus completing 
the proof of theorem 12.11 We have made no attempt to optimize the constants in this proof (an 
upper bound of 12 follows from the reasoning above). In fact, in an implementation, we can 
use geometrically increasing sampling rates until either the sampled graph has a perfect matching, 
or the sampling rate becomes so large that the expected running time of Hopcroft and Karp [8] 
algorithm is £l(m). 
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3 A Faster Algorithm for Perfect Matchings in Regular Bipartite 
Graphs 



We now show that the sampling theorem from the preceding section can be used to obtain a faster 
randomized algorithm for finding perfect matchings in ci-regular bipartite graphs. 

Theorem 3.1 There exists an 0(min{m, n rf lnn }) expected time algorithm for finding a perfect 
matching in a d-regular bipartite graph with 2n vertices and m = nd edges. 

Proof: Let G be a <i-regular bipartite graph with 2n vertices and m = nd edges. If d < n 3//4 Vlnn, 
we use the 0(m) time algorithm of Cole, Ost, and Schirra [6] for finding a perfect matching in a 
(i-regular bipartite graph. It is easy to see that m < i n this case. 

Otherwise, we sample the edges in G at a rate of p = cn ^ n for some suitably large constant 
c (c = 48 suffices by the reasoning from the previous section), and by Theorem 12.14 the sampled 
graph G' contains a perfect matching w.h.p. The expected number of edges, say m', in the sampled 
graph G' is 0( IL -^). We can now use the algorithm of Hopcroft and Karp [8] to find a maximum 
matching in the bipartite graph G 1 in expected time 0(m' \/n). The sampling is then repeated if no 
perfect matching exists in G' . This takes 0( n rf lnn ) expected running time. Hence, the algorithm 
takes 0(min{m, " 2 ' } nn }) expected time. ■ 

Note that by aborting the computation whenever the number of sampled edges is more than 
twice the expected value, the above algorithm can be easily converted to a Monte-Carlo algorithm 
with a worst-case running time of 0(min{m, - J rf lnn }) and a probability of success = 1 — o(l). 
Finally, it is easy to verify that the stated running time never exceeds 0(n 1 ' 75 %/ln~n). 



4 Uniform Sampling for Perfect Matchings: A Lower Bound 

We now present a construction that shows that the uniform sampling rate of Theorem l2.1l is optimal 
to within a factor of (9(ln 2 n). As before, for any graph G the graph obtained by sampling the 
edges of G uniformly with probability p is denoted by G'. 

Theorem 4.1 Let d(n) be a non- decreasing positive integer valued function such that for some 
fixed integer uq, it always satisfies one of the following two conditions for all n > uq: (a) d(n) < 
-y/n/lnn, or (b) \jnj In n < d(n) < n/hxn. Then there exists a family of d(n) -regular bipartite 
graphs G n with 2n + o{n) vertices such that the probability that the graph G' n , obtained by sampling 
edges ofG n with probability p, has a perfect matching goes to zero faster than any inverse polynomial 
function in n if p = o(l) when d{n) satisfies condition (a) above, and if 

P = °((d(n)) 2 lnn) 
when d(n) satisfies condition (b) above. 

Proof: Note that the theorem asserts that essentially no sampling can be done when d(n) < 
\Jn/\a.n. We shall omit the dependence on n in d(n) to simplify notation. 

Define if M = (U, V, E) , < k < d, as a bipartite graph with \U\ = \ V\ = d such that k vertices 
in U (respectively V) have degree (d — 1) and the remaining vertices have degree d. We will call 



S 



the vertices of degree (d — 1) deficient. Clearly, for any < k < d, the graph H ^ exists: starting 
with a d-regular bipartite graph on 2d vertices, we can remove an arbitrary subset of k edges that 
belong to a perfect matching in the graph. In the following construction, we will use copies of 
as building blocks to create our final instance. In doing so, only the set of deficient vertices in a 
copy of H^ will be connected to (deficient) vertices in other copies in our construction. 



We now define a <i-regular bipartite graph G n . Let 7 



d' 2 Inn 



(note that 7 < d since 



d < n/lnn). We choose W = - , kj = 7 for 1 < j < W, and kw = d — ^{W — 1) < 7. We also 
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define K(n) = [Inn] if d(n) > \Jnj In n and K(n) = otherwise. 

The graph G n consists of K(n) ■ W copies of H^ k > that we index as {#i,j}i<i<x(n),i<j<vv- The 
subgraph Hi is a copy of H^ kj \ where kj is as defined above. Note that the sum of the number of 
deficient vertices over each of the parts of Hij, 1 < j < W, equals d for all fixed i. Moreover, the 
number of deficient vertices in Hij is the same for all i when j is held fixed. 

We now introduce two distinguished vertices u and v and add additional edges as follows: 

1. For every 1 < i < K(n) and for every 1 < j < W, all deficient vertices in part V of H^j are 
matched to the deficient vertices in part U of Hi + \ j (that is, we insert an arbitrary matching 
between these two sets of vertices); 

2. All deficient vertices in part U of H\ ,• for 1 < j < W are connected to u; 

3. All deficient vertices in part V of H K ^ j for 1 < j < W are connected to v. 

Essentially, we are connecting the graphs Hi j for fixed j in series via their deficient vertices, 
and then connecting the left ends of these chains to the distinguished vertex u and the right ends 
of the chains to the distinguished vertex v. 

We note that the graph G n constructed as described above is a d-regular bipartite graph with 
2dK(n)W + 2 = 2n + o(n) vertices. 

Consider the sampled graph G' n . Suppose G' n has a perfect matching M. In the matching 
M, if u is matched to a vertex in part U of H[ ■ for some 1 < j < W, then there must be a 
vertex in part V of H[ ■ that is matched to a vertex in part U of H' 2 j. Proceeding in the same 
way, one concludes that for every i, 1 < i < K{n) there must be a vertex in part V of H[ ■ that 
is matched to a vertex in part U of H' i+l -. Finally, vertex v must be matched to a vertex in 
part V of H' K ^ ■. This implies that the sampled graph G' n can have a perfect matching only if 
at least one edge survives in G' n between every pair of adjacent elements in the sequence below: 
u ~ *■ H hj ~ *■ ^2,j H K ^ n - ) _ l j — > H K{jl) j — > v. 

Now suppose that we sample edges uniformly with probability p. It follows from the construction 
of G n that for any fixed j, the probability that at least one edge survives between every pair of 
adjacent elements in the sequence u — > H\j — > H2J -Hx(n)-ij — *• -^x(n)j — > v is equal to 

(i-a-pf-) Kl " ,+l <( P k,f^. 

Hence, the probability that at least one such path survives in G' n is at most 

K(n)+1 



W p max kj 

i<j<w J 
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by the union bound. 

When d(n) < sjnj Inn, we have 7 = 1, W = d, kj = 1 and K(n) = \n/d 2 ~\. So the bound 
transforms to 

W p K W +1 = dpW d ^ +1 , (1) 

which goes to zero faster than any inverse polynomial function in n when p = o(l) since K{n) = 
\n/d 2 ] = n(lnn). 

d 2 lnr 



When d > \Jnj Inn, we have kj < 7 where 7 
the bound becomes 



, W 



W{jn f(n)+1 = d (p7) [lnnl + l 



and K(n) = [Inn]. Hence, 



(2) 



which goes to zero faster than any inverse polynomial function in n when p = o( , a " )■ This 
completes the proof of the theorem. 

■ 

The construction given in Theorem 14.11 shows that the sampling upper bound for preserving a 
perfect matching proved in Theorem 12. II is tight up to a factor of 0(ln 2 n). 
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